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Abstract 

o 

We study the following vertex-weighted online bipartite matching problem: G{U, V, E) is a 
bipartite graph. The vertices in U have weights and are known ahead of time, while the vertices 
in V arrive online in an arbitrary order and have to be matched upon arrival. The goal is to 
maximize the sum of weights of the matched vertices in U. When all the weights are equal, this 
reduces to the classic online bipartite matching problem for which Karp, Vazirani and Vazirani 
gave an optimal (l — -) -competitive algorithm in their seminal work |KVV90| . 

Our main result is an optimal (l — -J -competitive randomized algorithm for general vertex 
weights. We use random perturbations of weights by appropriately chosen multiplicative factors. 
Our solution constitutes the first known generalization of the algorithm in [KVV90] in this model 
O . and provides new insights into the role of randomization in online allocation problems. It also 

effectively solves the problem of online budgeted allocations [MSVV05] in the case when an 
agent makes the same bid for any desired item, even if the bid is comparable to his budget - 
complementing the results of [MSVV05, BJN07] which apply when the bids are much smaller 
than the budgets. 
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1 Introduction 

Online bipartite matching is a fundamental problem with numerous applications such as matching 
candidates to jobs, ads to advertisers, or boys to girls. A canonical result in online bipartite 
matching is due to Karp, Vazirani and Vazirani |KVV90| . who gave an optimal online algorithm for 
the unweighted case to maximize the size of the matching. In their model, we are given a bipartite 
graph G(U, V, E). The vertices in U are known ahead of time, while the vertices in V arrive one at 
a time online in an arbitrary order. When a vertex in V arrives, the edges incident to it are revealed 
and it can be matched to a neighboring vertex in U that has not already been matched. A match 
once made cannot be revoked. The goal is to maximize the number of matched vertices. 

However, in many real world scenarios, the value received from matching a vertex might be 
different for different vertices: (1) Advertisers in online display ad-campaigns are willing to pay 
a fixed amount every time their graphic ad is shown on a website. By specifying their targeting 
criteria, they can choose the set of websites they are interested in. Each impression of an ad can be 
thought of as matching the impression to the advertiser, collecting revenue equal to the advertiser's 
bid. (2) Consider the sale of an inventory of items such as cars. Buyers arrive in an online manner 
looking to purchase one out of a specified set of items they are interested in. The sale of an item 
generates revenue equal to the price of the item. The goal in both these cases is to maximize the 
total revenue. With this background, we consider the following problem: 

Online vertex- weighted bipartite matching: The input instance is a bipartite graph 
G(U, V, E, {b u } U £jj), with the vertices in U and their weights b u known ahead of time. Vertices in 
V arrive one at a time, online, revealing their incident edges. An arriving vertex can be matched to 
an unmatched neighbor upon arrival. Matches once made cannot be revoked later and a vertex left 
unmatched upon arrival cannot be matched later. The goal is to maximize the sum of the weights 
of the matched vertices in U. 

Connection to the online budgeted allocation problem: Apart from being a natural gener- 
alization of the online bipartite matching problem, our vertex-weighted matching problem is closely 
related to an important class of online problems. Mehta et al [MSVV05J considered the follow- 
ing online version of maximum budgeted allocation problem [G KP0l( ILLN01] to model sponsored 
search auctions: We have n agents and m items. Each agent i specifies a monetary budget Bi and 
a bid bij for each item j. Items arrive online, and must be immediately allocated to an agent. If a 
set S of items is allocated to agent i, then the agent pays the minimum of Bi and X^eS^i" ^ ne 
objective is to maximize the total revenue of the algorithm. An important and unsolved restricted 
case of this problem is when all the non-zero bids of an agent are equal, i.e. bij = bi or for all j. 
This case reduces to our vertex- weighted matching problem (For a proof, refer to Appendix |A|) . 

For the general online budgeted allocation problem, no factor better than ^ (achieved by a 
simple deterministic greedy algorithm |LLN01| ) is yet known. The best known lower bound stands 
at 1 — - due to the hardness result in |KVV90) for the case when all bids and budgets are equal 
to 1 - which is equivalent to the unweighted online matching problem. The small bids case - where 
bij <C Bi for all i and j - was solved by |MSVV05[ IBJN07] achieving the optimal 1 — ~ deterministic 
competitive ratio. It was believed that handling large bids requires the use of randomization, as 
in |KW90j . In particular, many attempts |KV07[ IBM08[ IGM08] had been made to simplify the 
analysis of the randomized algorithm in |KVV90| . but no generalization had been achieved. 

Our solution to the vert ex- weighted matching problem is a significant step in this direction. Our 
algorithm generalizes that of [KVV90J and provides new insights into the role of randomization in 
these solutions, as outlined in Section Tl. 11 Finally, our algorithm has interesting connections to the 
solution of [MSVV05J for the small bids case - despite the fact that the vertex-weighted matching 
problem is neither harder nor easier than the small bids case. This strongly suggests a possible 
unified approach to the unrestricted online budgeted allocation problem. See Section [L2] for details. 



1.1 Overview of the Result 

Solution to the unweighted case: To describe our result, it is instructive to start at the un- 
weighted case (b u = 1 for all u € U) and study its solution by |KVV90| . Two natural approaches 
that match each arriving v £ V to the an unmatched neighbor in U chosen (a) arbitrarily and 
(b) randomly, both fail to achieve competitive ratio better than i. Their solution is an elegant 
randomized algorithm called RANKING that works as follows: it begins by picking a uniformly ran- 
dom permutation of the vertices in U (called the "ranking" of the vertices). Then, as a vertex in 
V arrives, it is matched to the highest-ranked unmatched neighbor. Surprisingly, this idea of us- 
ing correlated randomness for all the arriving vertices achieves the optimal competitive ratio of 1 — -. 

How do we generalize RANKING in presence of unrestricted weights b u ? The natural GREEDY al- 
gorithm which matches an arriving vertex to the highest-weighted unmatched neighbor, achieves 
a competitive ratio of ^ (see Appendix [B] for a proof). No deterministic algorithm can do bet- 
ter. While the optimality of RANKING for unweighted matching suggests choosing random ranking 
permutations of U, RANKING itself can do as badly as factor - for some weighted instances. 

The main challenge in solving this problem is that a good algorithm must follow very different 
strategies depending on the weights in the input instance. GREEDY and RANKING are both subop- 
timal for this problem, but both have ideas which are essential to its solution. In particular, they 
perform well on distinct classes of inputs, namely, GREEDY on highly skewed weights and RANK- 
ING on equal weights. The following observation about RANKING helps us bridge the gap between 
these two approaches: Suppose we perturb each weight b u identically and independently and then 
sort the vertices in the order of decreasing perturbed weights. When all the weights are equal, 
the resulting order happens to be a uniformly random permutation of U and thus, RANKING on 
unweighted instances can be thought of as GREEDY on perturbed weights! We use this insight to 
construct our solution to the vertex-weighted matching problem. While the nature of perturbation 
used did not matter in the above discussion, we need a very specific perturbation procedure for 
general vertex- weights. 

Our algorithm is defined below: 

Algorithm 1: Perturbed-Greedy 
For each u € U, pick a number x u uniformly at random from [0, 1]. 
Define the function i/)(x) := 1 — e~^ x \ 
foreach arriving v G V do 

Match v to the unmatched neighbor u £ U with the highest value of b u ip(x u ). Break ties 

consistently, say by vertex id. 



Remarks: It is not obvious, and indeed is remarkable in our opinion, that it suffices to perturb each 
weight b u completely independently of other weights. In Appendix[Cl we provide intuition as to why 
such is the case. Also, the particular form of the function ip is not a pre-conceived choice, but rather 
an artifact of our analysis. This combined with the discussion in Section [L2l seems to suggest that ijj 
is the 'right' perturbation function. We note that we can also choose the function ip(x) to be l — e~ x , 
which keeps the algorithm and results unchanged. Finally, we note that the multipliers y u = i^{x u ) 
are distributed according to the density function f(y) = j— — for y G [0, 1 — -] . Therefore, we 
could have equivalently stated our algorithm as: For each u £ U, choose a random multiplier 
y u € [0, 1 — -] from the above distribution, and use b u y u as the perturbed weight. 

Our main result is the following theorem. The second part of the theorem follows from the 
optimality of RANKING for unweighted matching [KVV90J. 

Theorem 1 PERTURBED-GREEDY achieves a competitive ratio of 1 — 1/e for the vertex-weighted 
online bipartite matching problem. No (randomized) algorithm has a better competitive ratio. 

In addition to the basic idea (from the proof of RANKING) of charging unmatched vertices in 
some probabilistic events to matched vertices in other events, our analysis needs to handle the new 



complexity introduced due to the weights on vertices. At a very high level, just like the algorithm, 
our analysis also manages to pull together the essence of the analyses of both GREEDY and RANKING. 

1.2 Implications of the Result 

Finding the optimal distribution over permutations of U: Since Perturbed-Greedy also 
chooses ranking orders through randomization, we can interpret it as a non-uniform RANKING, 
where it chooses permutations of U from the 'optimal' distribution. But we could have posed 
the following question, without the knowledge of our algorithm: How do we find an optimal non- 
uniform distribution over permutations of Ul As a start, let us consider the case of 2 x 2 graphs. By 
exhaustive search over all 2 x 2 graphs, we can figure out the best RANKING like algorithm for 2x2 
graphs (Figure [2] in Appendix |Pl shows the only two potentially 'hard' instances in 2 x 2 graphs). 
This algorithm picks the permutation (1/1,1*2) with probability j^ and the permutation (1*2,1*1) 
with probability j^— (where a = b Ul /b U2 ), and then proceeds to match to the highest neighbor. 
This algorithm gives a factor of 7— jora , which is minimized at a = 1, giving a factor of 3/4 (in 
which case the algorithm is simply the same as RANKING). 

An attempt to generalize this idea to larger graphs fails due to a blow-up in complexity. In 
general, we need a probability variable p a for every permutation a of U. The expected weight of the 
matching produced by the algorithm on a graph G, is a linear expression ALGG(p ffl ,p ff2 , ...). Thus, 
the optimal distribution over permutations is given by the optimal solution of a linear program in 
the p a variables. But this LP has exponentially many variables (one per permutation) and con- 
straints (one per "canonical graph instance"). Therefore, our algorithm can be thought of as solving 
this extremely large LP through a very simple process. 

General capacities / Matching u € U multiple times: Consider the following generalization of 
the online vertex- weighted bipartite matching problem: Apart from a weight b u , each vertex u E U 
has a capacity c u such that u can be matched to at most c u vertices in V. The capacities allow us to 
better model 'budgets' in many practical situations, e.g., in online advertising. Our algorithm easily 
handles general capacities: For each u £ U, make c u copies of u and solve the resulting instance 

with unit capacities: It is easy to verify that the solution is (l )-approximate in expectation for 

the original problem with capacities. 

Online budgeted allocation :- The single bids case vs. the small bids case: As noted 
earlier and proved in Appendix [Aj the special case of the online budgeted allocation problem with 
all the non-zero bids of an agent being equal (bij = bi or 0), reduces to our vertex- weighted matching 
problem. Since each agent provides a single bid value for all items, let us call this restriction the 
single bids case. 

Corollary 2 PERTURBED-GREEDY achieves a competitive ratio of 1 — 1/e for the single bids case 
of the online budgeted allocation problem. 

Note that the small bids case (bij <C Bi) studied in [MSVV05, BJN07] does not reduce to or 
from the single bids case. Yet, as it turns out, PERTURBED-GREEDY is equivalent to the algorithm 
of [MSVV05J - let us call it MSVV - on instances that belong to the intersection of the two cases. 
When every agent has a single small bid value, the problem corresponds to vertex-weighted matching 
with large capacities c u for every vertex u. Recall that we handle capacities on u £ U by making 
c u copies u\,U2, ■■■,u Cu of u. For each of these copies, we choose a random x Ui E [0, 1] uniformly 
and independently. In expectation, the x Ui : s are uniformly distributed in the interval [0,1]. Also 
observe that PERTURBED-GREEDY will match ui,U2,---,u 0u in the increasing order of x u -'s, if at 
all. Therefore, at any point in the algorithm, if m is the unmatched copy of u with smallest x Ui 
(and consequently highest multiplier tp(x Ui )) then x Ui is in expectation equal to the fraction of the 
capacity c u used up at that point. But MSVV uses exactly the scaling factor ip(T) where T is the 



fraction of spent budget at any point. We conclude that in expectation, PERTURBED- GREEDY tends 
to MSVV as the capacities grow large, in the single small bids case. 

It is important to see that this phenomenon is not merely a consequence of the common choice 
of function ip. In fact, the function ip is not a matter of choice at all - it is a by-product of both 
analyses (Refer to the remark at the end of Section [3]). The fact that it happens to be the exact 
same function seems to suggest that ip is the 'right' function. Moreover, the analyses of the two 
algorithms do not imply one-another. Our variables are about expected gains and losses over a 
probability space, while the algorithm in [MSVV05] is purely deterministic. 

This smooth 'interface' between the seemingly unrelated single bids and small bids cases hints 
towards the existence of a unified solution to the general online budgeted allocation problem. 

1.3 Other Related Work 

Our problem is a special case of online bipartite matching with edge weights, which has been studied 
extensively in the literature. With general edge weights and vertices arriving in adversarial order, 
every algorithm can be arbitrarily bad (see Appendix iGl) . There are two ways to get around this 
hardness: (a) assume that vertices arrive in a random order, and/or (b) assume some restriction on 
the edge weights. 

When the vertices arrive in random order, it corresponds to a generalization of the secretary 
problem to transversal matroids [BIK07] . Dimitrov and Plaxton [DP08] study a special case where 
the weight of an edge (u, v) depends only on the vertex v - this is similar to the problem we study, 
except that it assumes a random arrival model (and assumes vertex weights on the online side). 
Korula and Pal [KP09J give an g-competitive algorithm for the problem with general edge weights 
and for the general secretary problem on transversal matroids. 

If one does not assume random arrival order, every algorithm can be arbitrarily bad with general 
edge weights or even with weights on arriving vertices. [KP93J introduce the assumption of edge 
weights coming from a metric space and give an optimal deterministic algorithm with a competitive 
factor of o- As far as we know, no better randomized algorithm is known for this problem. 

Finally, there has been other recent work [DH091 IGM08| IFMMM09) . although not directly re- 
lated to our results, which study online bipartite matching and budgeted allocations in stochastic 
arrival settings. 

Roadmap: The rest of the paper is structured as follows: In Section [2] we set up the preliminaries 
and provide a warm up analysis of a proof of RANKING in the unweighted special case. Section [3] 
contains the proof of Theorem [TJ 

2 Preliminaries 

2.1 Problem Statement 

Consider an undirected bipartite graph G(U, V, E). The vertices of U, which we will refer to as the 
offline side, are known from the start. We are also given a weight b u for each vertex u G U. The 
vertices of V, referred to as the online side, arrive one at a time (in an arbitrary order). When 
a vertex v arrives, all the edges incident to it are revealed, and at this point, the vertex v can be 
matched to one of its unmatched neighbors (irrevocably) or left permanently unmatched. The goal 
is to maximize the sum of the weights of matched vertices in U. 

Let permutation n represent the arrival order of vertices in V and let M be the subset of 
matched vertices of U at the end. Then for the input (G,tt), the gain of the algorithm, denoted by 

ALG(G,n),isEueMbu- 

We use competitive analysis to analyze the performance of an algorithm. Let M*(G) be an 

optimal (offline) matching, i.e. one that maximizes the total gain for G (note that the optimal 
matching depends only on G, and is independent of 7r), and let OPT(G) be the total gain achieved 



by M*(G). Then the competitive ratio of an algorithm is minG )7r qpt/gV ■ Our goal is to devise an 
online algorithm with a high competitive ratio. 

Definition 1 (M*(G)) For a given G, we will fix a particular optimal matching, and refer to it as 
the optimal offline matching M* (G) . 

Definition 2 (u*) Given a G, its optimal offline matching M*(G) and a u S U that is matched in 
M*(G), we define u* (zV as its partner in M*{G). 

2.2 Warm-up: Analysis of Ranking for Unweighted Online Bipartite Matching 

Recall that online bipartite matching is a special case of our problem in which the weight of each 
vertex is 1, i.e. b u = 1 for all u € U. [KVV90J gave an elegant randomized algorithm for this 
problem and showed that it achieves a competitive ratio of (1 — 1/e) in expectation. In this section, 
we will re-prove this classical result as a warm-up for the proof of the main result. The following 
proof is based on those presented by |BM081 |GM08j previously. 

Algorithm 2: Ranking 

Choose a random permutation a of U uniformly from the space of all permutations. 
foreach arriving v € V do 
I Match v to the unmatched neighbor in u which appears earliest in a. 

Theorem 3 ( [KVV90] ) In expectation, the competitive ratio of RANKING is at least 1 . 

In this warm-up exercise, we will simplify the analysis by making the following assumptions: 
\U\ = \V\ = n and G has a perfect matching. These two assumptions imply that OPT = n and 
that the optimal matching M*(G) is a perfect matching. 

For any permutation a, let RANKING((j) denote the matching produced by RANKING when the 
randomly chosen permutation happens to be a. For a permutation a = (1*1,1*2, ■■■,u n ) of U, we say 
that a vertex u = Ut has rank <r(i*) = t. Consider the random variable 

( 1 If the vertex at rank i in a is matched by Ranking(cj). 

V(T ' i \ Otherwise 

Definition 3 (Qt, Rt) Qt is defined as the set of all occurrences of matched vertices in the proba- 
bility space. 

Qt = { (cr, t) : y ff<t = 1 } 

Similarly, Rt is defined as the set of all occurrences of unmatched vertices in the probability space. 

R t = { (M) : y<r,t = } 

Let Xf be the probability that the vertex at rank t in a is matched in RANKING(cr), over the 
random choice of permutation a. Then, xt = -^ and 1 — Xt = —^r- The expected gain of the 
algorithm is ALGc i7r = ^2 t xt- 

Definition 4 (o- l u ) For any a , let a l u be the permutation obtained by removing u from a and insert- 
ing it back into a at position i. 

Lemma 4 If the vertex u at rank t in a is unmatched by RANKING (a), then for every 1 < i < n, 
u* is matched in RANKING (a % u ) to a vertex v! such that cr % u {v!) < t. 



Proof: Refer to Lemma [5] in the analysis of Perturbed-Greedy for the proof of a more general 
version of this statement. □ 

In other words, for every vertex that remains unmatched in some event in the probability space, 
there are many matched vertices in many different events in the space. In the remaining part of 
this section, we quantify this effect by bounding 1 — xj, which is the probability that the vertex at 
rank t in a (chosen randomly by RANKING) is unmatched, in terms of some of the xjs. 

Definition 5 (Charging map f(a,t)) f is a map from bad events (where vertices remain un- 
matched) to good events (where vertices get matched). For each (o~,t) € Rt, 

f(<r,t) = {(o"^,s) : 1 < i < n, o~(u) = t and Ranking(<tJJ matches u* to v! where o~ % u {u') = s} 

In other words, let u be the vertex at rank t in a. Then f(a, t) contains all (a' , s), such that a' 
can be obtained from a by moving u to some position and s is the rank of the vertex to which u* , 
the optimal partner of u, is matched in a' . 

For every (cr, t) € Rt, (ft, s) € f(o~,t) implies y„- )S = 1 for some s <t. Therefore, 

U /(M) c \Jq s 

(<r,t)€Rt 8<t 

Claim 1 If (p, s) £ f(o~, t) and (p, s) E f(a, t), then a = a. 

Proof: Let v! be the vertex in p at rank s. Let u* be the vertex to which v! is matched by RANKING. 
Then it is clear from the definition of the map / that p = o~u = 0u , implying a = a. □ 

The claim proves that for a fixed t, the set-values f(o~,t) are disjoint for different a. Therefore, 

]_ V^ \Qs\ _ /^s<t x s 



\Rt\ J 

1- x t -- 



U(<r,t)eR t f( a ^) 1 LUtQ< 



n! n n\ n n\ n * — ' n\ n 

s<t 

Therefore, the probabilities xt's obey the equation 1 — Xj < ^^s<t x « f° r an ^- Since any 
vertex with rank 1 in any of the random permutations will be matched, x\ = 1. One can make 
simple arguments |KVV90[ lBM08. GM08J to prove that under these conditions, ALGg,tt = ^2t x t — 
(l ) n = (l — -) OPT, thereby proving Theorem [3l 

3 Proof Of Theorem Q] 

In this section, we will assume that \U\ = \V\ = n and that G has a perfect matching. In Appendix 
[F]we will show how this assumption can be removed. 

Recall that our algorithm works as follows: For each u G U, let o~(u) be a number picked 
uniformly at random from [0, 1] (and independent of other vertices) Now, when the next vertex 
v £ V arrives, match it to the available neighbor u with the maximum value of b u ip(o~(u)), where 
ij)(x) := l-e~^- x \ 

For ease of exposition, we will prove our result for a discrete version of this algorithm. For every 
u € U we will choose a random integer o~(u) uniformly from {1, ..., A;} where k is the parameter of dis- 
cretization. We will also replace the function ip(x) by its discrete version ip(i) = 1— (l — V\ 
The discrete version of our algorithm also matches each incoming vertex v E V to the available neigh- 
bor u with the maximum value of h u ip(a(u)). Notice that ip is a decreasing function, so ip(s) > ip(t) 
if s < t. As k — > co, the discrete version tends to our original algorithm. 

We begin with some definitions, followed by an overview of the proof. 

We will denote by a € [k] n , the set of these random choices. We will say that u is at position t 
in a if o~(u) = t. As a matter of notation, we will say that position s is lower (resp. higher) than t 
if s < t (resp. s > t). 



Definition 6 (u is matched in a) We say that u is matched in a if our algorithm matches it 
when the overall choice of random positions happens to be a. 

Let y fft t be the indicator variable denoting that the vertex at position t is matched in a. 

Definition 7 (Qt, Rt) Qt is defined as the set of all occurrences of matched vertices in the proba- 
bility space. 

Qt = {(a, t, u) : a(u) = t and y a<t = 1} 

Similarly, Rt is defined as the set of all occurrences of unmatched vertices in the probability space. 

R t = {(a, t, u) : a(u) = t and y ayt = 0} 
Let xt be the expected gain at t, over the random choice of a. Then, 



x t 



22{a,t,u)eQt ^ u 



k n v ' 

The expected gain of the algorithm is ALGc i7r = Yut x t- Also note that the optimal gain at any 
position t is B = — J*—*- since each vertex in U appears at position t with probability l/k and is 
matched in the optimal matching. Therefore, 

B _ WflA (2) 

Definition 8 (er„) For any a , a^ £ [k] n is obtained from a by changing the position of u to i, i.e. 
a u( u ) = i an d a u( u ') = a ( u ') f or Q tt u ' 7^ u - 

Observation 1 For all (o~,t,u) € Rt and 1 < i < k, our algorithm matches u* to some v! € U in 
<■ 

The above observation follows from Lemma [5j We'll use it to define a map from bad events to good 
events as follows. 

Definition 9 (Charging Map f(o~,t,u)) For every (o~,t,u) € Rt, define the set-valued map 
f(o~,t,u) = {{a l u ,s,v!) : 1 < i < k, and the algorithm matches u* to vl in a l u where cr^u') = s} 

Observation 2 If (p,s,u') € f(a,t,u), then (p,s,u') € Q s . 
Now we are ready to give an overview of the proof. 

Overview of the proof 

The key idea in the analysis of RANKING in Section 12.21 was that we can bound the number of 
occurrences of unmatched vertices - the bad events - in the entire probability space by a careful 
count of the matched vertices - the good events. The charging map / defined above is an attempt 
to do this. We'll show in Lemma [5] that if {a l u ,s,u') S f(a,t,u), then the scaled (by ip) gain due 
to v! in a l u is no less than the scaled loss due to u in a. However, s may be higher or lower than t, 
unlike RANKING where s < t. This implies that the bound is in terms of events in M Q s , 1 < s < k, 
which is very weak (as many of the events in the union are not used) . 

One idea is to bound the sum of losses incurred at all positions, thereby using almost all the 
events in (J s Q s . However, if we do this, then the charging map loses the disjointness property, i.e. 
if (a,t,u) € Rt and (a l u ,i,u) € Ri then / value of both these occurrences is the same. Thus, each 
event in \J S Q s gets charged several times (in fact a non-uniform number of times), again making 
the bound weak. To this end, we introduce the idea of marginal loss (Definition I10j) . which helps us 
define a disjoint map and get a tight bound. 

Next, we formalize the above. 



Formal proof 

We begin by proving an analogue of Lemma |4j 

Lemma 5 If the vertex u at position t in a is unmatched by our algorithm, then for every 1 < i < k, 
the algorithm matches u* in a % u to a vertex u' such that tp(t)b u < ip (tj l u (u')\ b u i . 

Proof: Case 1 (i > t): Let v\, ...,v n be the order of arrival of vertices in V. Clearly, v\ will see 
the same choice of neighbors in a l u as in a, except the fact that the position of u is higher in a l u 
than in a. Since we did not match v\ to u in a, v\ will retain its match from a even in a' l u . Now 
assuming that vi,...,vi all match the same vertex in a l u as they did in a, w/+i will see the same 
choice of neighbors in ct 1 u as in a with the exception of u. Since vi + \ did not match u in a either, it 
will retain the same neighbor in a % u and by induction every vertex from V, specifically u* keeps the 
same match in a l u as in a. Since o~(u') = a l u {u ! ), we conclude ip(t)b u < ip (c^(ti')) b u i. 

Case 2 (i < t): For a vertex v € V, let m a {v) and m a i (v) be the vertices to which v is matched in 
cr and a l u respectively, if such a match exists and null otherwise. Intuitively, since ip(i) > ip(t), the 
scaling factor of b u only improves in this case, while that of any other vertex in U remains the same. 
Therefore, we can expect u to be more likely to be matched in o~ l u and the ip {a l u {m a i (v ))) b m . r v \ > 
ip (cr (m a (v))) b mrT i v \ to hold for all v E V. In fact, something more specific is true. The symmetric 
difference of the two matchings produced by the algorithm for a and a l u is exactly one path starting 
at u that looks like (u, v±, m a (vi), V2, m a (v2), •••), where (vi,V2,—) appear in their order of 
arrival. In what follows we prove this formally. 

Let V = {v € V : m a (v) ^ m a i (v)} be the set of vertices in V with different matches in a 
and a % u . Index the members of V as v\, ■■■,vi in the same order as their arrival, i.e. v\ arrives the 
earliest. For simplicity, let Uj = m a (vj) and Wj = m a i (vj). 

We assert that the following invariant holds for 2 < j < I: Both Uj and Uj-\ are unmatched in 
a l u when Vj arrives and Vj matches Uj—i, i.e. Wj = Uj—\. 

For base case, observe that the choice of neighbors for v\ in a % u is the same as in a, except u, which 
has moved to a lower position. Since by definition v\ does not match u\ in a % ul w\ = u. Now consider 
the situation when t>2 arrives. All the vertices arriving before t>2 - with the exception of v\ - have 
been matched to the same vertex in a l u as in a, and v\ has matched to u, leaving u\ yet unmatched. 
Let U a (v2) and U a i (V2) be the sets of unmatched neighbors of V2 in a and a % u respectively at the 
moment when V2 arrives. Then from above arguments, U a i (V2) = (U a (v2) U {^i}) — {u}. Since u 
was unmatched in a, U2 7^ u. Since V2 € V', W2 7^ ^2- This is only possible if W2 = U\. And hence 
the base case is true. 

Now assume that the statement holds for j — 1 and consider the arrival of Vj. By induction 
hypothesis, v\ has been matched to u and V2, ..,Vj-i have been matched to u±, ...,Uj-2 respectively. 
All the other vertices arriving before Vj that are not in V' have been matched to the same vertex 
in a l u as in a. Therefore, Uj-\ is yet unmatched. Let U a (vj) and U a i (vj) be the sets of unmatched 
neighbors of Vj in a and a % u respectively at the moment when Vj arrives. Then from above arguments, 
U a i (vj) = (U a (vj) U {uj_i}) — {u}. Since u was unmatched in a, Uj 7^ u. Given that Wj 7^ Uj, the 
only possibility is Wj = Uj—\. Hence the proof of the inductive statement is complete. 

If u* £ V then u' = m a i (u*) = m a {u*) and the statement of the lemma clearly holds since 
a(u') = a l u (u'). If u* = v\, then u' = u and ip (o"^(ti')) b u > = tp(i)b u > tp(t)b u since i < t. Now 
suppose u* = Vj for some j > 2. Then u' = Uj-\ and by the invariant above, 

V K(</)) K> = V> K(«i-i)) b Uj _, > V K(n,)) b u . (3) 

= i>{a{ Uj ))b Uj (4) 

> ip{t)b u (5) 

Equation Q follows from the fact that u* = Vj was matched in a x u to Uj-i when Uj was also 
unmatched. The fact that only u changes its position between a and a l u leads us to (|4|). Finally, 



equation ([5]) follows from the fact that u* was matched to Uj in a when u was also unmatched. □ 
Using the above lemma, we get the following easy observation. 

Observation 3 For all (a,t,u) E Rt, 1 <t <k, f(a,t,u) contains k values. 

Remark: As noted in the overview, although Lemma [5] looks very similar to Lemma [IJ it is 
not sufficient to get the result, since the good events pointed to by Lemma [5] are scattered among 
all positions 1 < s < k - in contrast to Lemma 0J which pointed to only lower positions s < t, 
giving too weak a bound. We try to fix this by combining the losses from all Rt. However we 
run into another difficulty in doing so. While for any fixed t, the maps f(a, t, u) are disjoint for 
all (a,t,u) £ Rt, but the maps for two occurrences in different Rts may not be disjoint. In fact, 
whenever some u is unmatched in a at position t, it will also remain unmatched in o~u for j > t, 
and the sets f{a,t,u) and f(crh,j,u) will be exactly the same! This situation is depicted in Figure 
[3] in Appendix [El 

This absence of disjointness again renders the bound too weak. To fix this, we carefully select a 
subset of bad events from (L Rt such that their set-functions are indeed disjoint, while at the same 
time, the total gain/loss can be easily expressed in terms of the bad events in this subset. 

Definition 10 (Marginal loss events St) Let St = {(a,t,u) G Rt : (c^ _1 ,t — l,u) £ Rt-l}> 

where Rq = 0. 

Informally, St consists of marginal losses. If u is unmatched at position t in a, but matched at 
position t— 1 in a l ~ x , then (o~,t,u) € St (See Figure [3] in Appendix [E]). The following property can 
be proved using the same arguments as in Case 1 in the proof of Lemma [5) 

Observation 4 For (o~,t,u) € St, u is matched at i in a l u if and only if i < t. 

Definition 11 (Expected Marginal Loss a t ) 

Expected marginal loss at position t = at = — — (6) 

Claim 2 

Vi; x t = B~Y,a s (7) 

s<t 

Total loss = ^(B -x t ) = ^ (k - t + l)a t (8) 

t t 

Proof: To prove equation ([7]), we will fix a t and construct a one-to-one map g : Rt — > Us<f5t- 
Given (a,t,u) £ Rt, let i be the lowest position of u such that u remains unmatched in a l u . By 
observation [31 i is unique for (a,t,u). We let g(a,t,u) = {a' l u ,i,u). Clearly, (cr*, i, u) € Si. To 
prove that the map is one-to-one, suppose (p, s, u) = g(o~, t, u) = g(a, t, u). Then by definition of g, 
p = o- s u =~o s u which is only possible if a = a. Therefore, \R t \ = \J s<t St- 

Lastly, observe that g maps an element of Rt corresponding to the vertex u being unmatched, 
to an element of Si corresponding to the same vertex u being unmatched. From equation ([2]), 

z2(a,t,u)eRt ^u s-^ zZ(ffi,i,u)eSi ^u s-^ 

B — x t = " T- = > , = > OLi 

k n ^ k n *-J 

i<t i<t 

This proves equation (|7|). Summing (|7|) for all t, we get ([8]). D 

Now consider the same set-valued map / from Definition [9l but restricted only to the members 
of \J t S t . We have: 



Claim 3 For (a,t,u) € St and (a,t,u) € Sj, if (p,s,u') £ f(a,t,u) and (p,s,u') € f(a,t,u) then 
a = a, t = t and u = u. 

Proof: If v! is matched to v in p then by definition of /, v = u* = u*, implying u = u. Therefore, 
p = a l u = a l u for some i. But this implies that a = a 3 u for some j. This is only possible for j = t since 
by definition, if u is unmatched in a at t, then there exists a unique i for which (cr^,i,ri) € Ut^t- 
If j = t, then a = a and t = t. □ 

Armed with this disjointness property, we can now prove our main theorem. 



Theorem 6 As k — )• oo, 



(9) 



J> > (l-±)oPT(G) 

Proof: Using Lemma [5] and Observation [3l we have for every (o~,t,u) € St, 

If we add the equation (I10p for all (<r, t, u) € St and for all 1 < t < n, then using Claim [3] and 
Observation [21 we arrive at 

i i 



i 



* v *<* / 



A; 

Equation (jlip follows from ([6]) and (JTJ) . Equation (|12p uses Claim [2j 
We now rearrange terms to get 



(12) 



E <*{«*) + %^) *#£««) p») 



fc 7 - A; 

When ^(t) = 1 - (1- i)^ 4+1 , observe that ^(t) + Es ^ V ' W > ^±±1 and £t^(*) = | as 
k —?■ oo. Using Claim [2j 

Total loss = ^ (B - x t ) = ^(Jfe - i + l)a t 

t 

= as k — > oo 

e 

OPT(G) 



Hence, as fc — > oo, 



H) 



Total gain > 1 - - OPT(G) 



Remark: Observe that we substituted for ip(t) only after equation (f!3j) - up until that point, any 
choice of a non-increasing function ip would have carried the analysis through. In fact, the chosen 
form of ip is a result of trying to reduce the left hand side of equation (|13p to the expected total 
loss. To conclude, the 'right' perturbation function is dictated by the analysis and not vice versa. 

□ 
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A The Reduction from Online Budgeted Allocation with Single 
Bids 

In this section, we will show that the single bids case of the online budgeted allocation problem 
reduces to online vert ex- weighted bipartite matching. Let us first define these problems. 

Online budgeted allocation: We have n agents and m items. Each agent i specifies a 
monetary budget Bi and a bid bij for each item j. Items arrive online, and must be immediately 
allocated to an agent. If a set S of items is allocated to agent i, then the agent pays the minimum 
of Bi and X/?e5^y- The objective is to maximize the total revenue of the algorithm. 

SINGLE BIDS CASE: Any bid made by agent i can take only two values: bi or 0. In other words, 
all the non-zero bids of an agent are equal. 

Claim 4 Online budgeted allocation with single bids reduces to online vertex-weighted bipartite 
matching. 

Proof: Given an instance of online budgeted allocation where agent i has budget Bi and single bid 
value bi, we will construct an input instance G(U, V, E, {b u } U £jj) of online vertex-weighted bipartite 
matching. The set V consists of one vertex corresponding to every item. The set U will contain 
one or more vertices for every agent. 

For every agent i, let n« be the largest integer such that n^ftj < Bi and let n = Bi — nibi. Clearly, 
Ti < bi. We will construct a set Ui of rii vertices, each with weight bi. In addition, if n > 0, then 
we will construct a vertex Ui with weight ri and add it to Ui. For all u € Ui and v E V, the edge 
uv € E if and only if agent i makes a non-zero bid on the item corresponding to v. 

(1) Given a solution to the budgeted allocation problem where a set Si of items is allocated to agent 
i, let us see how to construct a solution to the vertex- weighted matching problem with the same 
total value. 

• If agent i pays a total of \Si\ ■ bi, then we know that \Si\ < rij. Hence, for every item in Si, 
we will match the corresponding vertex in V to a vertex in Ui — {ui}. Let Ri be the set of 
vertices in Ui thus matched. We have: 

2 b u = \Ri\ ■ bi = \Si\ ■ bi 

• If agent i pays a total amount strictly less than \Si\ ■ bi, then we know that: (a) \Si\ > n, + 1, 
(b) r i > and (3) agent i pays the budget Bi . We can now choose any rii + 1 items in Si and 
match the corresponding vertices in V to the n, + 1 vertices in Ui . The sum of the weights of 
matched vertices in Ui, X^ugC/ °u = B^. 

Summing over all i, the weight of the matching formed is equal to the total revenue of the budgeted 
allocation. Let OPT^ and OPT^ denote the values of the optimal solutions of the budgeted 
allocation and the vertex-weighted matching problems respectively. Then we conclude from the 
above discussion that: 

OPT M >OPT A (14) 

(2) Given a solution to the vertex-weighted matching problem where a set R C U of vertices is 
matched, let us see how to construct a solution to the budgeted allocation problem with at least 
the same total value. Let Ri = RDUi. For every v € V that is matched to a vertex in Ri, we will 
allocate the corresponding item to agent i. Let Si be the set of items allocated to agent i. 



• If \Ri\ = \Si\ < rii, then agent i pays a total of \Si\ -bi and we have: 

^2 K < \Ri\ ■ h = \Si\ ■ hi 

• If on the other hand, \Ri\ = \Si\ = rn + 1 then agent i pays a total of B{ and we have: 

^2 h u = ^^K = Bi 
uGRi u<=Ui 

Summing over all i, the total revenue of the budgeted allocation is at least the weight of the matching. 
Let ALGm be the expected weight of the vertex-weighted matching constructed by PERTURBED- 
GREEDY and ALG^ be the expected value of the budgeted allocation constructed using the above 
scheme. From the above discussion, we conclude: Therefore, 



ALG A > ALG 



> 



M 

1 _ l\ 0PT M 



(15) 



> ( 1 - - ] OPT A 



Here, equation (fT5|) follows from the main result - Theorem Q]- and the last step uses equation 
14|) . This completes our proof. 

□ 



B Performance of GREEDY and RANKING 

With non-equal weights, it is clearly preferable to match vertices with larger weight. This leads to 
the following natural algorithm. 



Algorithm 3: Greedy 



foreach arriving v £ V do 

Match v to the unmatched neighbor in u which maximizes b u (breaking ties arbitrarily); 

It is not hard to show that GREEDY achieves a competitive ratio of at least g- 
Lemma 7 Greedy achieves a competitive ratio of 1/2 in vertex-weighted online bipartite matching. 

Proof: Consider an optimal offline matching, and a vertex u £ U that is matched in the optimal 
offline matching but not in the greedy algorithm. Now look at a vertex u* £ V that is matched to 
the vertex u in the optimal matching. In GREEDY, u* must have been matched to a vertex u £ U, 
s.t. b u < b i, since u was unmatched when u* was being matched. So we'll charge the loss of b u 
to u . Note that each u does not get charged more than once - it is charged only by the optimal 
partner of its partner in the algorithm's matching. Thus the loss of the algorithm is no more than 
the value of the matching output by the algorithm. Hence the claim. □ 

In fact, this factor 1/2 is tight for GREEDY as shown by an instance consisting of many copies 
of the following gadget on four vertices, with ux,U2 £ U and v\,V2 £ V. As e — > 0, the competitive 
ratio of GREEDY tends to \. 



b u , = 1 



b„.> = 1 




Ui 



Vi 





Graph edge 
Matching edge 



U 2 V2 

Optimal 

matching 



«2 v 2 

GREEDY 
matching 



Notice that this counter-example relies on weights being roughly equal. We, however, know that 
RANKING has an expected competitive ratio of (1 — 1/e) when the weights are equal. On the other 
hand, if the weights are very different, i.e. e is large, in the above example, then GREEDY provides 
a good competitive ratio. At the same time, if we exchanged the weights on the two vertices in the 
example to be b Ul = 1 and b U2 = 1 + e, then as e grows large, the expected competitive ratio of 
RANKING drops to -^ and on larger examples, it can be as low as — . To summarize, GREEDY tends 
to perform well when the weights are highly skewed and RANKING performs well when the weights 
are roughly equal. 



C Intuition Behind the Sufficiency of Independent Perturbations 

Recall that our algorithm perturbs each weight b u independent of the other weights. The fact that 
Perturbed-Greedy achieves the best possible competitive ratio is a post-facto proof that such 
independence in perturbations is sufficient. Without the knowledge of our algorithm, one could 
reasonably believe that the vector of vertex-weights {b u } U £u - which is known offline - contains 
valuable information which can be exploited. In what follows we provide intuition as to why this is 
not the case. 

Consider the two input instances in Figure [TJ Both the connected components in G\ have equal 
weights, and hence we know that RANKING achieves the best possible competitive ratio on G\. 
Similarly, both connected components in Gi have highly skewed weights, suggesting GREEDY as 
the optimal algorithm. On the other hand, RANKING and GREEDY are far from optimal on G2 and 
G\ respectively. Since two instances with identical values of vertex-weights require widely differing 
strategies, this exercise suggests that we may not be losing must information by perturbing weights 
independently. The optimality of our algorithm proves this suggestion. 
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Graph G\ 
Equal weights =$> RANKING 



Graph G2 
Skewed weights =$> GREEDY 



Figure 1: Two instances with the same vertex- weights, but widely differing optimal strategies. 



D Hard Instances in 2 x 2 Graphs 

Figure [2] shows the only two potentially 'hard' instances in 2 x 2 graphs. On all other instances, the 
optimal matching is found by any reasonable algorithm that leaves a vertex v € V unmatched only 
if all its neighbors are already matched. 
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Figure 2: Canonical examples for 2x2 graphs. 
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F Graphs with Imperfect Matchings 

In Section [3l we proved Theorem Q] for graphs G(U,V,E) such that \U\ = \V\ and G has a perfect 
matching. We can remove these assumptions with just a few modifications to the definitions and 
equations involved in the proof. The algorithm remains unchanged, i.e. we just use PERTURBED- 
GREEDY. We will only outline these modifications and the rest of the proof follows easily. Let 
M*(G) be a maximum weight matching in G(U,V, E) and U be the set of vertices in U matched 
by M*(G). Thus we know that OPT(G) = E ue F 6 «- 

Keeping the definition of Qt the same, we change the definition of Rt to: 

Rt = {(a,t,u) : u € U and a{u) = t and y ffit = 0} 

The above redefinition conveys the fact that if a vertex u is not matched by M*(G), then we no 
longer consider u being unmatched a bad event. Consequently, equation ([2]) changes to: 



Z-^(<J,t,U 



(cr,t,u)eRt 



B-x t < 

which in turn yields following counterpart of equation ([7J : 



Vi, 



xt > B-^2 



rt.s 



(16) 



s<t 



Let Eq(t) be the version of (fT6|) for t. We then multiply Eq(t) by ip(t) — ifj(t + 1) and sum over 
1 < t < n to obtain a combined inequality (with ip(k + 1) =0): 



£>(*) - rl>(t + l))xt > f/;(l)B - J^^(t) 



at 



5D^(*)at > ^(1) 



OPT(G) 

k 



(1-^t + i)) 

2^ s x * 



(17) 



Equation (|17p used the definition of tp(t) = 1 — (l — r) ■ Combining equation (|17p with 

(fTTD . we get: 



1^, M ,, ,OPT(G) v-^(l-V(i + l)) 



> ( 1-- ) op-riCri 



as k ->• oo, since ^(1) ->■ (l - ±) and ^>(t) - ^(< + 1) = (1 ~^ +1) -+ as k 



oo. 



G A Lower Bound for Randomized Algorithms with Edge Weights 

In this section, we will sketch the proof of a lower bound for the competitive ratio of a randomized 
algorithm, when the graph G(U, V, E) has edge weights and our objective is to find a matching in 
G with maximum total weight of edges. Previous studies of this problem have only mentioned that 
no constant factor can be achieved when the vertices in V arrive in an online manner. However, we 
have not been able to find a proof of this lower bound for randomized algorithms in any literature. 
We prove the result when the algorithm is restricted to be scale-free. A scale-free algorithm in this 
context produces the exact same matching when all the edge weights are multiplied by the same 
factor. 

Consider a graph G(U, V, E) such that U contains just one vertex u and each vertex in v E V has 
an edge to u of weight b v . Fix vi, v%, ... to be the order in which the vertices of V arrive online. By 
Yao's principle, it suffices for us to produce a probability distribution over b vl ,b V2 , ... such that no 
deterministic algorithm can perform well in expectation. We will denote the vector of edge weights in 
the same order in which the corresponding vertices in V arrive, i.e. (b vl ,b V2 , ...) and so on. Consider 
the following n vectors of edge weights: For every 1 < i < n, bj = (D l , D l+1 , ..., D n ,0, 0, ...) and 
so on, where D is a sufficiently large number. Suppose our input distribution chooses each one of 
these n vectors of edge weights with equal probability. 

Clearly, regardless of the vector which is chosen, OPT(G) = D n . Since an algorithm is assumed 
to be scale-free and online, it makes the exact same decisions after the arrival of first k vertices for 
each of the edge weight vectors hj, 1 < j < k. Therefore, it cannot distinguish between bi,...,bfc 
after just k steps. Hence, we can characterize any algorithm by the unique k such that it matches 
the k'th. vertex in V with a positive weight edge. 

Let ALG be any deterministic algorithm that matches the fe'th incoming vertex with a positive 

weight edge to u. Then the expected weight of the edge chosen by ALG is — > D % . Since D is 

n ^— ' 

i>k 

large, this is at most -OPT(G), where c is some constant. Applying Yao's principle, we conclude 
that the competitive ratio of the best scale-free randomized algorithm for online bipartite matching 
with edge weights is O (-) . 



